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The small-world property in the context of complex networks implies structural benefits to the 
processes taking place within a network, such as optimal information transmission and robustness. 

In this paper, we study a model network of integrate-and-fire neurons that are subject to activity- 
dependent synaptic plasticity. We find the learning rule that gives rise to a small-world structure 
when the collective dynamics of the system reaches a critical state which is characterised by power- 
law distributions of activity clusters. Moreover, by analysing the motif profile of the networks, we 
observe that bidirectional connectivity is impaired by the effects of this type of plasticity. 

PACS numbers: 05.65.+b, 05.70.Fh, 87.19.La 


I. INTRODUCTION 

In their seminal paper, Watts and Strogatz [Tj de¬ 
scribed a class of networks that lie halfway between com¬ 
pletely random and regular networks. This class of net¬ 
works is characterised by a small average path length 
(which is typical for dense random networks) and an av¬ 
erage clustering coefficient significantly larger than ex¬ 
pected by chance (as in many regular lattices). However, 
the notion of a special class of small-world networks is 
somewhat misleading and it seems more appropriate to 
speak of a small-world, property, i.e. this property is not a 
binary but a gradual one and is stronger in some networks 
as compared to others as can be expressed by quantita¬ 
tive measures [2] . 

The small-world property offers a structural benefit to 
the processes taking place within the network, e.g. with 
respect to information transmission by speeding-up the 
communication among distant nodes in a regular net¬ 
work that can be reached via short-cuts in the small- 
world case. Moreover, this class of networks is a suit¬ 
able model in social, technological and biological contexts 
etc. [2j. For example, it has been observed that the elec¬ 
trical power grid network of western United States, the 
co-stardom network comprising movie actors and their 
collaborations and the neural network of the nematode 
worm C. elegans have the structural properties of low 
average path length along with a high average clustering 
coefficient pQ. 

In another seminal paper, Barabasi and Albert [3] pro¬ 
posed a model, known as the BA model , to explain the 
emergence of scale invariance in the degree distribution 
e.g. in the World Wide Web (WWW) [3]. In these scale- 
free networks the probability P(k) that a node connects 
to k other nodes follows a power-law P(k) ~ fc -7 [3]. It 
implies the existence of many poorly connected nodes co- 
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existing with very few but not negligible massively con¬ 
nected nodes hubs. Scale-invariant degree distribution 
and the small-world property are by no means exclusive 
and in fact many scale-invariant networks are also small- 
world. 

A look at the degree distribution of a network and the 
estimation of some network statistics like local clustering 
coefficients and shortest path lengths provide an insight¬ 
ful initial description of network structure. A more de¬ 
tailed picture of the network structure can be obtained 
by considering the relative frequency of particular con¬ 
figurations of small blocks in comparison to random net¬ 
works [3]. In particular, 3-node subgraphs, the motifs, 
describe the relationship among nodes taken in threes. 
Song et al. [6] observed that the motif distribution of 
acute slices taken from the visual cortex of rats exhibits 
an overrepresentation of some 3-node patterns when com¬ 
pared to random networks. Moreover, networks from a 
similar context share similarities in their motif profiles, 
so that these 3-node subgraphs might even define broad 
classes, or super-families, of networks urn- Similar motif 
distributions could point to similar dynamical processes 
in the network, e.g. food networks represent the flow of 
energy from bottom to top of the food web resulting in 
a particular motif configuration, whereas brain networks 
represent a flow of information without necessarily im¬ 
plying a particular direction of the flow, which results in 
a different motif profile [5]. 

When it comes to the dynamics occurring in a system 
that comprises a large number of simple elements inter¬ 
acting in a network, a large number of studies have been 
dedicated to the occurrence of power-law behaviour and 
its relationship with the concept of self-organised crit¬ 
icality (SOC) [5]. The concept of SOC has been sug¬ 
gested to explain the dynamics of phenomena as diverse 
as plate tectonics [5] , piles of granular matter m, forest 
fires El, neuronal avalanches m (see below), and mass 
extinctions [13], among several others. SOC implies the 
existence of a critical point that becomes an attractor in 
the collective dynamics of a system which resides thus 
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at the boundary between two different phases of the sys¬ 
tem (e.g. ‘order’ and ‘chaos’). SOC can be characterised 
by power-law event-size distributions, divergence of the 
correlation length, critical slowing-down of the decay of 
perturbation as well as the existence of avalanche-like 
causation of events in the system [ 8 ] . 

In neural systems, an activated neuron can trigger a 
chain reaction which takes the form of a cascade of activ¬ 
ity. These neuronal avalanches have been observed in bi¬ 
ological neurons m as well as in models (see Sect. |II A[ ). 
It has been reported that the distribution of avalanche 
sizes and avalanche durations can be approximated by 
power laws with resp. exponents 7 = —3/2 and <5 = 
—2 in the thermodynamic limit in globally-coupled net¬ 
works mm ■ Interestingly, the theoretical predictions 
are in good agreement with experiemental results in real 
brain tissue in local field potentials of cultured slices of 
rat cortex mi, and in the superficial cortical layers of 
awake, resting primates }16| . 


Critical dynamics of brain networks have been studied 
thoroughly in artificial models, and it has been found that 
the critical regime implies several computational benefits 
for the system, namely: optimal information transmis¬ 
sion and maximum dynamic range El, maximum infor¬ 
mation storage mm, stability of information transmis¬ 
sion ( 20 ], among others. 

An important area of research in network science is fo¬ 
cused on studying the mechanisms by which nodes con¬ 
nect and disconnect during the development of a network. 
Complex networks often possess feedback mechanisms by 
which the node dynamics and their interactions affect 
the structure of the network, which in turn alters the 
behaviour of the nodes etc. It is thus useful to consider 
both the dynamics on networks as well as the dynamics 
of networks [ 2111221 - 


In brain networks, synaptic plasticity refers to the 
structural changes that neuronal networks undergo 
through the strengthening or weakening of synaptic con¬ 
nections in response to the in-going activity. In partic¬ 
ular, spike-timing dependent plasticity (STDP) captures 
the existing temporal correlations between the spikes of 
pre- and post-synaptic units resulting in a temporally 
asymmetric learning rule that has been proposed as a 
mechanism for learning and memory in the brain 231 . 
STDP is a local rule that emphasises the precise timing of 
each individual spike, it also incorporates and extends the 
essential mechanism of Hebbian learning by including the 
notion of long-term potentiation (LTP) and long-term 
depression (LTD) of synapses based on the differences in 
activation times for pre- and post-synaptic neurons [24] , 


In this paper we study the effects of STDP mecha¬ 
nisms over systems comprising integrate-and-fire neurons 
poised at criticality and whose structures follow a com¬ 
plex topology. In the following sections we report how 
these two concepts combined, that of STDP and that 
of criticality, have an effect on structural properties of 
the network which are summarised in the concept of the 
small-world property. 


II. MODEL 
A. The Eurich model 

The model consists of N non-leaky integrate-and-fire 
nodes and was formulated for all-to-all connectivity m, 
but here we will also consider heterogeneous directed net¬ 
works. 

In the model, every node j is characterised by a mem¬ 
brane potential hj which is a continuous variable updated 
in discrete time according to the following equation: 

N 

hj(t + 1 ) = hj(t) + AijWijSi(t) + I ex t ( 1 ) 
i =1 

where A represents the adjacency matrix with entries 
A t j = 1 if node i sends an edge to node j, and A t j = 0 
otherwise; Wij denotes the synaptic strength from node 
i to node /; Si(t) G {1,0} represents the state of node 
i (active or quiescent, respectively) at time t ; and I ex t 
denotes external input which is supplied to a node de¬ 
pending on the state of the system at time t. If there 
is no activity at time t, then a node is chosen uniformly 
at random and its membrane potential is increased by 
a fixed amount through the variable I ext that represents 
the external driving of the system. When the potential 
exceeds a neural threshold, i.e. hi[t) > 9, the node be¬ 
comes active, Si(t) = 1. Afterwards, this node is reset, 
i.e. hi = 0 . 

In the simulations we study networks of different sizes 
and with the following network topologies: 

i. fully connected, 

ii. random, 

Hi. scale-free with low mean clustering coefficient (CC) 
and power-law out-degree distribution, 

iv. scale-free with high CC and power-law out-degree 
distribution, 

v. scale-free with low CC and power-law in-degree dis¬ 
tribution, 

vi. scale-free with high CC and power-law in-degree dis¬ 
tribution. 

It is worth mentioning that for the case of random and 
scale-free structures, the networks considered possess the 
same number of edges per system size, which results in 
the same average connectivity per network. In other 
words, as they have the same number of edges, their 
structure results from a permutation of the edges. For 
a random network, edges are inserted independently as 
in the Erdos-Renyi model [25], for scale-free networks we 
follow Ref. [26]. The latter algorithm achieves tunable 
clustering by performing a triad-formation step in addi¬ 
tion to growth and preferential attachment (which are 
the same as in the BA model 0)- 
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Thus, we consider two levels of mean clustering for 
scale-free networks (low and high) by tuning a simple 
parameter [2(3 . The small-world property is not a binary 
one, and as such, there exists a degree of what we would 
call small-world-ness. The process of tuning the mean 
clustering coefficient in our scale-free networks has an 
immediate effect in the degree of small-world-ness of such 
structures. 


B. Broadcasting hubs and absorbing hubs 

In directed networks, the in-degree and the out-degree 
distribution are generally not the same. Some directed 
networks possess a power-law distribution in both their 
in-degree and out-degree distributions, but with different 
exponents (e.g. the WWW [25]), whereas others have a 
power law only in one of the two directions (e.g. citation 
networks j25]). In either case, the presence of a long-tail 
in the out-degree distribution of a network implies the 
existence of broadcasting hubs, that is, nodes that have 
massive outgoing connections compared with other nodes 
in the system. On the contrary, the presence of a long-tail 
in the in-degree distribution implies the existence of ab¬ 
sorbing hubs, that is nodes that have a big amount of in¬ 
coming connections. Here, we are interested in analysing 
how collective dynamics develop for the case of networks 
with broadcasting hubs and for networks with absorbing 
hubs. 


C. Spike-timing dependent plasticity (STDP) 


As mentioned in Sect. [TJ STDP is a temporally asym¬ 
metric form of Hebbian learning induced by temporal cor¬ 
relations between pre- and postsynaptic neurons. Synap¬ 
tic weight between pre- and postsynaptic nodes is poten¬ 
tiated (increased), if the postsynaptic neuron fires shortly 
after the presynaptic neuron. It is depressed (decreased) 
if the opposite happens, namely the post-synaptic neu¬ 
ron fires shortly before the pre-synaptic neuron. We im¬ 
plemented STDP mechanisms in our model through the 
following set of equations: 


Wij (t + 1) = Wij (f) + Awij (At) (2) 


where At = t pos t ~ t pre denotes the difference between 
spikes of pre- and post-synaptic neurons, and: 


Awij(At) 


a p exp{ } if At > 0 
—a^exp{^} if At < 0 


where parameters a p and T p set the amount and dura¬ 
tion of LTP, whereas ad and Td set the amount and du¬ 
ration of LTD. In our experiments we set a p = ad = 0.1. 
Observations of STDP in brain tissue suggest that the 
time window for potentiation is typically shorter than 
the depression time window : 23] . for that reason we let 


T p = 10 time-steps and Td = 20 time-steps. However, it 
also has been observed that time-windows and amount 
of potentiation/depression vary across species and brain 
structures [ 23 ]. 

We impose hard bounds on synaptic weights, that is, 
0 < w m in < Wij < w m ax Vz , j, which prevents unbounded 
weight growth, gives rise to strong competition between 
inputs to a neuron and results in a bimodal distribution of 
the synaptic weights at the end of simulation time 1271 . In 
order to allow for activity-dependent pruning of synapses, 
we set w^ = 0, if w^ < w m i n following application of Eq. 
[2] If the connection is to be terminally deleted, we set 
also Aij = 0 once Wij = 0. 


D. Estimation of small-world-ness. 


Following the ideas in Ref. [2], we estimate the degree 
of small-world-ness in a network through the following 
process. Let G be a network consisting of n nodes and e 
edges. To test whether G exhibits the small-world prop¬ 
erty we construct a random network R with same number 
of nodes and same number of edges. Then, we estimate 
the mean clustering coefficient of both networks CCg 
and CGr along with their mean path length Lq and Lr. 
Finally, we compute S as the ratio of those values: 


CCg/CCr 

Lg/Lr 


(3) 


If S > 1, then G possesses the small-world property [2], 
which implies a more abundant presence of cliques and 
long-range connections among nodes than expected by 
chance. 


E. Numerical implementation 


When starting simulations, all membrane potentials 
are initialised at random, whereas states are set to inac¬ 
tive. The neural threshold 6 will always be set to unity. 
We let the system reach the critical state, which is identi¬ 
fied by a power-law approximation of the distribution of 
avalanches (see below), which implies that large events 
coexist with small events during running time. After¬ 
wards we apply STDP mechanisms to update the synap¬ 
tic strength among nodes based on their activity. At the 
end of simulations, we analyse the resulting network and 
the dynamics of the system. 

Both the relaxation time towards the critical state as 
well as the sampling time needed to assess criticality 
depend on the system size. We consider network sizes 
N = 128, 256 and 512 and choose the initial ‘settling’ 
time of the networks to, resp., 10 6 , 2 x 10 6 , 3 x 10 6 be¬ 
fore introducing STDP mechanisms in the system. This 
selection of times is appropriate for large events to take 
place during simulation time. 
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Results are fitted by power-law distributions and the 
quality of the fit is evaluated by the mean-squared devi¬ 
ation Ay. This is obtained based on the best-matching 
power-law exponent 7 from linear regression in log-log 
scales. Our choice of using this method is due to its sim¬ 
plicity and justified by the asymptotic unbiasedness of 
the estimation. When this error function is at its min¬ 
imum, that is, when the data is best approximated by 
a power-law distribution with exponent 7, is then when 
the system is at criticality. 

Moreover, following Ref. [251 we inspect the value of 
the largest eigenvalue of the matrices W associated to 
each network and whose entries Wij denote the synaptic 
weight between node i and j. The authors in Ref. [25] 
observe that the largest eigenvalue of the weight matrix 
governs the dynamics of the system. Through an an¬ 
alytical derivation, it is reported that when the largest 
eigenvalue equals unity the system is at the critical state. 

In Table [T] we report the value of the largest eigenvalue 
A of the weight matrices associated to our networks. In 
our experiments the critical state is not only identified by 
the power-law distribution of avalanche sizes (see Fig. [ 3 ]) 
before the onset of STDP mechanisms, but also by the 
value close to unity of A. 

After the system has reached such a state, we let STDP 
mechanisms to set in at the synaptic level, that is, at 
the level of individual coupling strengths. For all system 
sizes and topologies we allow four million time steps of 
STDP regime, after which the simulation is over and we 
proceed to analyse the structural changes of the system. 
Unlike the model in Ref. [25] we do not distinguish be¬ 
tween STDP-neurons and non-STDP-neurons, rather in 
our model every synapse in every unit is susceptible of 
plasticity. 

For our experiments we consider 50 different networks 
per class (b to d in Sect. II A) and system size for the sake 
of statistical robustness. In the case of fully-connected 
networks, as there exists only one fully-connected net¬ 
work of size iV, randomness is introduced in our code 
for each realisation of the experiment rather than in the 
structure as for the other network classes considered. Af¬ 
ter each realisation of the experiment we generat e 100 
random networks to compute the metric S (see Sect. IID) 
for each of the networks that result from the simulation. 
Experiments were carried out in the EDDIE computer 
cluster of the University of Edinburgh. 


III. RESULTS 

A. Small-world structure emerges in 
fully-connected networks 

We observe that after simulation time a small-world 
structure emerges from fully-connected networks as mea¬ 
sured by the metric S. In Fig. [l] we show the evolu¬ 
tion of S per time step for the fully-connected and ran¬ 
dom topologies considered. (We present mean values and 



timesteps (during STDP) 

(a) Fully-connected 



timesteps (during STDP) 


(b) Random 

FIG. 1: Evolution of small-world-ness measured by 
Eq. [3] for fully-connected (a) and random (b) networks. 

Small-world-ness increases due to STDP for 
fully-connected networks, which is not observed for the 
random networks. This more evident for smaller system 
sizes. We show mean values with error bars 
representing standard deviations. 


standard deviations estimated from the different realisa¬ 
tions of our experiment.) 

Unlike the fully-connected case, the random structure 
does not show any particular trend regarding the evolu¬ 
tion of S. Therefore, STDP does not imply any improve¬ 
ment in the structure of random networks. Moreover, as 
we can see in Fig. [TJ STDP acts faster in smaller systems 
than in larger ones, thus its effect is clearer in the net¬ 
works comprising 128 nodes than in those of 512 nodes. 
In any case, we observe a positive trend in the evolution 
of S. 

What is the effect of STDP in the scale-free networks 
considered? As mentioned previously, these structures 
already possess the small-world topology, that is S' > 1 
for all of them, however the degree of small-world-ness 
in them vary. We considered high and low values of 



































5 


the mean clustering coefficient for this type of networks, 
this results respectively in high and low degrees of srnall- 
world-ness in the networks considered. The reason we 
considered such classes is because we are interested in 
assessing how STDP mechanisms affect scale-free struc¬ 
tures with varying clustering coefficients. Additionally, 
as stated previously, we consider scale-free networks with 
broadcasting hubs and networks with absorbing hubs. 

For these topologies, STDP has a negative effect re¬ 
garding their small-world-ness, identified by a decrease 
in the value of S once STDP sets in. For example, in 
Fig. [2a] we show a comparison of the evolution of S for 
two scale-free networks with 512 nodes and low mean 
clustering coefficient (which implies low degree of small- 
world-ness). The network identified by the continuous 
line has a power-law out-degree distribution (which yields 
broadcasting hubs), whereas the dashed line represents 
its transpose, that is, a network with a power-law in¬ 
degree distribution (which yields absorbing hubs). As 
it can be seen, STDP affects negatively the small-world 
property in these networks. 

Similarly, Fig.[2b]traces the changes of S for two scale- 
free networks of same size as above but with high de¬ 
gree of small-world-ness. The continuous line represents 
a scale-free network with power-law out-degree distribu¬ 
tion, whereas the dashed line represents its transpose, a 
network with a power-law in-degree distribution. In the 
best of cases, S does not exhibit an noticeable decrease. 
These situations are verified in all other systems sizes 
considered for scale-free networks. 

It is clear that STDP does not have the same effects 
when the power-law of the degree distribution is present 
in the out-degree distribution or in the in-degree distri¬ 
bution; nor when the clustering coefficient of network is 
low or high. The trend, however, is not unambiguous. 
Nevertheless, we find that STDP affects negatively the 
degree of small-world-ness in scale-free networks. 


B. STDP impairs criticality 


As mentioned in Sect. m we assess the quality 
of the power-law approximation to the distribution of 
avalanches by estimating the deviation from the best 
power-law fit. When such an error function reaches a 
minimum value of less than 0.05, we consider the event- 
size distribution as well approximated by a power-law 
and conclude that the system is in a critical state. Af¬ 
ter the system has approached a power-law behaviour, 
we allow STDP mechanisms to set in. We observe that 
criticality is generally lessened while STDP is modifying 
the synaptic strengths. With STDP the difference be¬ 
tween the distribution of avalanche sizes and the best-fit 
power-law increases again, such that eventually a power- 


law ceases to appropriately model the system, see Fig. 2b 


A similar behaviour as shown there is observed in all the 
networks that we considered. 

The deterioration of the power-law approximation is 



timcstcps (during STDP) 

(a) Low CC 



(b) High CC 

FIG. 2: Effects of STDP on scale-free networks of size 
N = 512. STDP reduces the degree of small-world-ness 
in scale-free networks. This is observed across all 
system sizes and when the network has either low (a) or 
high (b) mean clustering coefficient. We compare the 
cases when the power law distribution is either in the 
out-degree (continuous line) or in the in-degree (dashed 
line) distribution. 


explained by the fact that once STDP sets in large 
avalanches cease to occur and only small avalanches take 
place. In this sense, it can be said that STDP favours 
local, clustered events. Fig. 2a shows an example of this 
behaviour. Here, the continuous line shows the system 
during criticality before STDP mechanisms take place, 
which can also be identified in the minimum value shown 
in Fig. 2b around the time step 2 x 10 6 . However, after 
STDP there are no more large avalanches to be added up 
to the distribution, only small ones, which results in the 
particular shape of the dashed line in Fig. [2a] As well, 
in Fig. 2b after STDP has set in right after the 2 x 10 6 
time step the deviation from a power-law distribution in¬ 
creases and we observe a larger error as time passes. 

As mentioned before, this same phenomenon is ob- 
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served across all networks considered in our model. This 
behaviour is captured by the largest eigenvalue of the 
weight matrix W denoted by A. Before STDP the value 
of A is close to unity. However, once STDP mechanisms 
set in, the synaptic modulation results in a weight matrix 
whose largest eigenvalue is less than unity, thus reflect¬ 
ing the deviation from criticality (see Table [i]) . From this 
we might conclude that in our model the critical state 
vanishes as STDP sets in. 

Does this imply that self-organised criticality and 
spike-timing-dependent plasticity are two biological phe¬ 
nomena that cannot coexist? The model proposed by 
Levina et al. m , based on dynamical synapses, ex¬ 
hibits critical behaviour that coexist with STDP. The 
modulation induced by such dynamical synapses results 
in a compensatory mechanism that recovers the critical 
regime. 


C. STDP prunes direct-feedback connections 

We analyse the motif profile of the networks after the 
STDP regime. There are 13 different motifs representing 
the possible relations among nodes taken in threes from a 
directed network. The different configurations are shown 
in Fig. [4j 

A motif profile shows the distribution of these 13 dif¬ 
ferent 3-node configurations for a single network. For 
example, before STDP, a fully-connected network pos¬ 
sess a single motif distribution: all 3-node relationships 
are of the XIII type (see Fig. |4|. However, after STDP 
the motif XIII breaks apart and instead motifs of the 
types I, II, IV, V and IX abound. Interestingly, none of 
these contain direct feedback connections. Fig. [5] shows 
the motif profiles of the fully-connected networks consid¬ 
ered (sizes 256 and 512 inside the inset) after STDP. For 
the smallest system size it is more evident how motifs /, 
II, IV, V and IX grow where previously there was only 
motif XIII. As the system grows, STDP requires more 
time to prune the network and profiles differ from one 
another. 

In the case of heterogeneous topologies, STDP attacks 
motifs with bidirectional connections affecting in this way 
the local clustering of the global structure, which in turn 
affects the degree of small-world-ness of the network. Mo¬ 
tifs with bidirectional connections are: III, VI, VII, VIII, 
X, XI, XII and XIII ; all of them impaired by the action 
of STDP. Fig. [6] shows this behaviour for our networks 
of 128 nodes. However, we observe this particular be¬ 
haviour in all the different system sizes considered. 

In particular, motif II is severely impaired in all cases 
considered. This motif represents the most basic feed¬ 
forward flow of information comprising 3 nodes, and as 
such the most elementary pre- and post-synaptic rela¬ 
tionship among three nodes, and as such it is expected 
to be a target for STDP. Interestingly, some motifs van¬ 
ish completely from the profile. These are motifs III, VI, 
VII, VIII, X, XI, XII and XIII; all of them involving 



(a) Distribution of avalanche sizes P(S) in log-log scales before 
and after STDP 



(b) Deviation from best-fit power-law per time-step 

FIG. 3: Effects of STDP on collective dynamics for a 
scale-free net with N = 256. Before STDP, small 
avalanches coexist with large ones, and their 
distribution can be approximated accurately by a 
power-law (a, continuous line), which is also identified 
by a small error around time-step 2 x 10 6 (b). However, 
after STDP, large avalanches cease to occur (a, dashed 
line) and the error exhibits an incremental trend for the 
rest of the simulation (b). 


direct feed-back connectivity. 

Therefore, in a sense it can be said that STDP prevents 
direct feedback connectivity, preferring indirect feedback 
flow in which a third node serves as intermediary. For ex¬ 
ample, motif X includes a direct feedback connection be¬ 
tween two nodes; however, after STDP this motif might 
transform into motif IX, in which the feedback flow is 
now mediated by a third node. 

STDP favours one-way connections rather than two- 
way connections, thus establishing a direction for the 
flow of stimuli given by the current activity in the sys¬ 
tem. Simply put, if node i sends an edge to node j, 
but this latter node fires shortly before the former most 
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FIG. 4: All configurations for 3-node connected 
subgraphs. 



FIG. 5: Post-STDP motif profile for fully connected 
networks of size N = 128. (Inset: sizes N = 256 and 
N = 512.) After STDP, motif XIII breaks apart and 
motifs I, II, IV, V and IX emerge. This is more evident 
for smaller system sizes, where STDP requires less time 
for pruning. 


of the time, then STDP acts by severing this somewhat 
erroneous connection. 


D. Other effects on topology 

Previously, we mentioned how STDP affects the de¬ 
gree of small-world-ness in the networks considered. The 
process of activity-dependent pruning has other conse¬ 
quences in topology, namely, the decrease in edge density 
and eventually, the disconnection of the network and the 
emergence of different connected components across the 
system. 

The density of a network is the ratio of the number 
of edges to the number of possible edges. As such, a 
fully-connected network possess all possible connections 
in the network resulting in the maximum possible value 
of the density, which is 1. As mentioned before, for any of 
the system sizes considered, the heterogeneous structures 
have the same number of edges, which yields the same 
density for all of them. However, how STDP affects this 
ratio varies depending on the particular type of network 
considered (fully-connected, random or scale-free). 



(a) outdegree-SF net with low CC (inset: its transpose). 



(b) outdegree-SF net with high CC (inset: its transpose). 

FIG. 6: Motif profiles for networks of size N = 128 pre- 
(blue) and post-STDP (orange) regime. STDP 
particularly affects motifs that contain direct feed-back 
connectivity (note that we are not assuming 
refractoriness in the neurons, otherwise higher-order 
motifs had to be considered). This is observed across all 
system sizes and when the network possess high (a) or 
low (b) mean clustering coefficient (CC), as well as 
when the network has an in-degree power-law 
distribution (insets). A similar behaviour is observed in 
random networks (not shown). 


Network density is an indicator of sparseness in a struc¬ 
ture. Our heterogeneous networks are sparse, a feature 
that has an effect on the way STDP acts upon the struc¬ 
ture. Fig. [7] shows the effects of STDP mechanisms on 
network density for all the fully-connected cases consid¬ 
ered and for the heterogeneous structures of size 128. We 
observe the same behaviour for the other system sizes 
considered. 

The effects of STDP are soon visible in a dense network 
topology. In a fully-connected network, STDP acts fast 
removing a considerable amount of edges as soon as it 
sets in. In contrast, for sparse-networks, either scale-free 
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FIG. 7: Density of fully-connected nets. STDP prunes 
edges here faster than in the other structures (see 

Fig.0. 


or random, STDP acts more slowly not even removing 
1% of the initial configuration under otherwise similar 
conditions. 

However, STDP does not act the same in every het¬ 
erogeneous topology. Scale-free networks in which the 
power-law is present in the in-degree distribution (that 
is, networks with absorbing hubs) lose density faster than 
any other, specially when the mean clustering coefficient 
is low. This can be explained by the fact that in in-degree 
scale-free networks absorbing hubs are more susceptible 
to fire, which in turn has a strong influence on STDP 
mechanisms. However, clustering seems to serve as a pro¬ 
tective mechanism to prevent edge pruning. This can be 
observed in the fact that our scale-free networks with low 
mean clustering coefficients lose edges faster than their 
high clustered counterparts. 

Nevertheless, out-degree scale-free networks (that is, 
networks with broadcasting hubs) decrease the size of 
their largest connected component faster than any other 
topology, specially when combined with high mean clus¬ 
tering coefficient. 

During simulation time, the effects of STDP pruning 
result on a disconnected network where components of 
different sizes emerge. All our observations regarding 
topology refer to the largest connected component (LCC) 
of the network. Because STDP behaves differently across 
the different topologies considered, we expect the LCC 
to be different as well. As we have just mentioned, the 
LCC of scale-free networks with broadcasting hubs suffer 
a shrinkage of size faster than random networks and scale- 
free networks with absorbing hubs, even if it is this latter 
type of networks the one that loses density faster than 
any other topology. This behaviour is shown in Fig. |8b| 
and it is observed across all system sizes considered. 

In summary, what is happening is that scale-free net¬ 
works with absorbing hubs lose edges faster than any 
other structure. However, these networks show more re¬ 
silience to become disconnected (Fig. 8a). On the con- 


(a) Network density 



(b) LCC size 

FIG. 8: STDP effects on network density and the size of 
the largest connected component (LCC) for 
heterogeneous structures of size N = 128. Unlike the 
fully-connected case, STDP prunes edges slowly in our 
heterogeneous structures, not even removing 1% of their 
edges by the end of the experiment. This can be 
explained by the sparseness of these networks. In-degree 
scale-free networks with low mean clustering coefficient 
(CC) lose edges faster than any other network (a), 
whereas the LCC of out-degree scale-free networks with 
high CC shrinks faster than any other structure. We 
observe this behavior across all system sizes. 


trary, scale-free networks with broadcasting hubs show 
more resilience to lose edges, but they are more prone to 
become disconnected when losing edges (Fig. 8b). 


E. Effects of spike triplets 

In this section we briefly discuss the effects of extending 
the implementation of STDP mechanisms from pairs of 
spikes to triplets of spikes. The computational implemen¬ 
tation of STDP described in Sect. Ill Cl takes into account 
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pairs of spikes to estimate the synaptic modulation at 
each connection, namely, one from the pre-synaptic node 
and another from the post-synaptic node, whose order 
determine the nature of the modulation. In the follow¬ 
ing, we refer to paired-wise STDP as pSTDP. 

It has been reported that pSTDP fails to replicate ob¬ 
servations in experimental data. Intuitively, a pre-post 
pairing followed by a post-pre pairing of the same magni¬ 
tude would cancel out any synaptic modulation triggered, 
however this is not what it has been observed in bio¬ 
logical experiments. In experiments, models of pSTDP 
are not able to explain the synaptic modulation trig¬ 
gered by higher-order plasticity rules such as triplets and 
quadruplets of spikes j30l [31]. For this reason, pSTDP 
has been extended in order to consider triplets of spikes 
rather than just pairs of them. In the following, we refer 
to triplet-wise STDP as tSTDP to differentiate it from 
pSTDP. 

A triplet rule for tSTDP involves sets of three spikes: 
two pre- and one postsynaptic, or one pre- and two post- 
synaptic spikes. With a triplet rule of this form it is pos¬ 
sible to fit experimental data from visual cortical slices as 
well as from hippocampal cultures. Interestingly, when 
this rule is based on Poisson spike trains, the learning rule 
can be mapped to a Bienestock-Cooper-Munro (BCM) 
learning rule [mi m- Moreover, it has been proposed 
that the triplet rule for STDP is a mechanism used by 
neuronal networks to perform computations that resem¬ 
ble those of independent component analysis (ICA) p2j . 

When we consider triplets of spikes for STDP, we might 
feel inclined to extend the model further in order to con¬ 
sider higher-order STDP rules. However, experiments 
show that spike triplets are able to reproduce data gen¬ 
erated by higher-order terms [501 . 

A model for tSTDP is implemented in the following 
way. The mechanism is similar to that of pSTDP in the 
sense that the difference between spike times in pre- and 
post-synaptic neurons determines the amount of synap¬ 
tic modulation, however one extra term is added to the 
weight update function. This extra term considers the 
temporal difference between the two most recent spikes 
of one of the spiking nodes. Hence, the spike triplet. Po¬ 
tentiation occurs in a similar way as in pSTDP, however 
the amount of synaptic modulation is also in function 
of the difference between the two most recent spikes of 
the post-synaptic unit. Therefore, this rule is identified 
as post-pre-post. Similarly, depression occurs analogously 
to pSTDP, but the synaptic update is in function of the 
temporal difference between the two most recent spikes 
of the pre-synaptic neuron. So, the rule is summarized as 
pre-post-pre. Figure [9] shows a schematic representation 
of this two rules. 


Thus, in order to extend pSTDP into tSTDP we add 
some extra terms to the equation for Amy (At) described 
in Sect. IIC which yields: 


p re Q 

POST (J 





LTP 




LTD 


FIG. 9: Mechanism of tSTDP. Potentiation is achieved 
when the pre-synaptic unit fires in between two 
post-synaptic spikes, whereas depression occurs when 
the post-synaptic unit fires in between two pre-synaptic 
spikes. 


Awij(At) 


a p exp{ j Tl } exp{ ^ T2 } if AT) > 0 
—ad exp{ } exp{ ~^ Ta } if ATi < 0 


where parameters a p and T p set the amount and duration 
of LTP, whereas and T^ set the amount and duration of 
LTD, as with pSTDP (see above); Ti represents the dif¬ 
ference between pre- and post-synaptic spikes, T 2 denotes 
the temporal difference between the two most immediate 
post-synaptic spikes (if ATi > 0) or the temporal dif¬ 
ference between the most immediate pre-synaptic spikes 
(if ATi < 0); and T x and T y are two parameters that in 
a similar fashion as with parameters Td and T p set the 
amount of influence of immediate spikes for depression 
and potentiation, respectively. 

With these ideas in mind we report the following ob¬ 
servations when considering tSTDP mechanisms in our 
model rather than simple pSTDP. Similar to pSTDP the 
synaptic modulation triggered by tSTDP mechanisms al¬ 
ter the quality of the power-law approximation to the 
distribution of avalanche sizes. As expected, this in turn 
has effects over the error function and the exponent of 
the power-law fit. This behaviour is also captured by the 
the largest eigenvalue A of the weight matrix W. Similar 
to pSTDP, when tSTDP sets in the synaptic modulation 
takes A away from unity reflecting the deviation from 
criticality induced by tSTDP (see Table |T|. 

However, although tSTDP has an effect on the mod¬ 
ulation of the synaptic strength between two connected 
nodes, this type of plasticity is less effective than pSTDP 
on severing connections; to the point that almost all 
edges survive after simulation time. Because of this, 
we do not observe any small-world structure emerging 
from fully-connected networks as we did with pSTDP. 
At criticality, tSTDP acts as a mild modulator of synap¬ 
tic weights based on the firing activity of pre- and post¬ 
synaptic nodes. 

Why tSTDP does not achieve the same amount of 
pruning as pSTDP? Let node i send an edge to node j; 
and let j spike before i. For potentiation to take place, a 
second spike from j to take place. Let us refer to the first 
spike of j as j 1 , and to its second spike as j' 2 , whereas 













10 


the the only spike of i will be denoted as i\. If node 
j’s first spike and node i’s only spike belong to the same 
avalanche we write j\,i\ £ Av\. As said above, for poten¬ 
tiation to occur, a second spike from j must take place. 
These leaves the following possibilities: 

1. ji,ii £ Av i and j -2 £ Av 2 - Here, for the maximum 
potentiation to occur the inter-avalanche interval 
must be very small. Such a regime, in which firing 
rate is high, is associated with super-criticality and 
not with criticality. 

2. j i £ Av i and £ Av 2 . Same as above. Here 
spikes are separated in two different avalanches that 
require to be as close as possible in order to poten¬ 
tiation be at its fullest. 

3- ji,h,h £ Av i- Here what is required is that the 
three spikes occur in the same avalanche. For this 
to happen node j must be active twice in a sin¬ 
gle avalanche giving place to a non-Hamiltonian 
avalanche, that is, an avalanche in which there is a 
node becomes active more than once. Although not 
shown here, this occurs naturally in scale-free and 
random networks at criticality due to the sparse¬ 
ness of the network. Therefore we expect this to 
happen in this type of networks and to be non¬ 
existent in fully-connected structures. 

We observe that the third case occurs in scale-free net¬ 
works and this will result in weights reaching the largest 
allowed value of synaptic weight ( w ma x ), whereas fully- 
connected networks fall in cases (1) and (2) described 
above, in which the weights do not settle in the maxi¬ 
mum synaptic weight and rather give rise to a bell-shaped 
distribution for the weights (not shown here). 

Let us consider the case where i fires before j. For 
depression to take place we require a second spike from 
node i. Similar to the case for potentiation, we have the 
following possibilities: 

1. ii,ji £ Av 1 and *2 £ Av-i- As with the cases pre¬ 
sented above, this situation requires a small inter¬ 
avalanche interval in order to get the maximum 
possible amount of synaptic depression. 

2. ii £ Av 1 and ji ,*2 £ Av 2 . Same as above. 

3. *1, Ji, i‘2 £ Avi. As with potentiation, this case 
requires non-Hamiltonian avalanches to be present 
in the system. As mentioned earlier, only scale- 
free and random estructures exhibit this kind of 
behaviour at criticality. 

Unlike potentiation, we do not observe a large amount 
of synaptic depression occurring in our networks, and 
consequently no synaptic pruning either. This is due to 
the asymmetry of the STDP learning rule, in which po¬ 
tentiation is benefited over depression plus the fact that 
the rule requires either a small inter-avalanche interval 
in the system or non-Hamiltonian avalanches. 


IV. DISCUSSION 


One important observation regarding small-world-ness 
in networks is that of the case of fully-connected topolo¬ 
gies. It can be argued that this particular type of struc¬ 
ture possesses the small-world property ab initio as it has 
both the maximum value of mean clustering coefficient 
and the lowest value of mean path length possibles, there¬ 
fore any alteration to its structure can only impair its de¬ 
gree of small-world-ness. This is a very valid observation. 
However, we should point out that a fully-connected net¬ 
work is a blank slate, in which no dynamic process has yet 
taken place. STDP carves a structure out of this topol¬ 
ogy (in particular, pSTDP), that changes the ratio of 
the mean path length and the mean clustering coefficient 
when compared to a random network with the same num¬ 
ber of nodes and edges. The resulting structure is better 
than the one that would have emerged from a process 
pruning edges randomly. Moreover, the value of S for a 
fully-connected structure is 1 always, as the comparison 
with a random network with same number of nodes and 
edges yields exactly the same fully-connected structure. 
However, we observe that after STDP the fully-connected 
network becomes a network in which clearly S > 1 (see 
Fig. la), which implies the presence of the small-world 
property. 

The model of Basalyga et al. j2S] is, unlike ours, not 
based on critical dynamics. They consider Erdos-Renyi 
networks of slightly more complex unit but only at a 
network size of N = 100 neurons, while we tried to con¬ 
sider scaling relations for system sizes larger than that. 
Since in there model only excitatory synapses are sub¬ 
ject to STDP, the topological effects are biased. While 
the main observation that STDP can impair criticality 
is reproduced here, we find a clear improvement of the 
small-world-ness of the emerging networks. 

Our analysis includes an inspection of the motif pro¬ 
file that resulted from STDP mechanisms. Song et al. [ 6 ] 
estimated the motif distribution of acute slices from the 
visual cortex of rats and observed that the motif profile 
of such networks differs from their random counterparts, 
in particular bidirectional connections were found to be 
more frequent than expected by chance. In contrast, 
we have obtained a slight decrease of bidiretional mo¬ 
tifs which is related to the particular form of the STDP 
rule used here [J3] when compared against a random net¬ 
work, which is not our case. As Song et al. claim, their 
counts are relative to random, whereas ours are abso¬ 
lute counts, that is, real counts which are not compared 
against a random network [ 6 ]. In their work, they present 
a ratio of actual counts to that predicted by their null hy¬ 
pothesis. In this context, bidirectional connections show 
an overrepresentation, a situation which is not in clash 
with our observations. When the ratio of motifs con¬ 
taining bidirectional connections is compared to that of 
non-bidirectional connections in the same network pro¬ 
file, we observe that the latter is much larger than the 
former, an observation that is in agreement with ours. If 
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we compare the resulting state of our model to the state 
before STDP was applied, we find that many bidirec¬ 
tional connections have been deleted. Moreover, it can 
be said that from a certain point of view, their observa¬ 
tions are just snapshots of a dynamic process happening 
in the networks that they considered. Such a process 
would have another effect in the long run, in which bidi¬ 
rectional connections are found more rarely than before. 
Also, these authors do not mention anything related to 
the type of dynamic process that gave rise to the struc¬ 
tures that they observed. Thus, the question regarding 
STDP as a mechanism that prunes two-way connections 
is not settled by their work. 

Ex nihilo nihil fit. The brain is shaped not only by 
genetics but also by activity-dependent processes during 
development. It is known that several brain structures 
possess the features commonly associated to small-world 
networks [341 . In particular, the only neural network that 
has been mapped in its entirety, namely the nervous sys¬ 


tem of the nematode C. elegans , is known to possess the 
small-world property [Tj. In higher organisms, genetic 
processes cannot fully account for the existence of this 
particular network structure, as during the lifetime of an 
individual, modification of the network are present due 
to activity-dependent processes, e.g. in connection to 
learning and memory. Here, we claim that massively con¬ 
nected structures combined with critical dynamics can 
give rise to a small-world structure already if a standard 
STDP rule is in place for adapting the network towards 
a better-than-random structure which is beneficial for in¬ 
formation transmission across the system. 
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